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Purpose: Whether self-reporting and clinician-rated depression scales correlate well with 
one another when applied to older adults has not been well studied, particularly among Asian 
samples. This study aimed to compare the level of agreement among measurements used in 
assessing major depressive disorder (MDD) among the Thai elderly and the factors associated 
with the differences found. 

Patients and methods: This was a prospective, follow-up study of elderly patients diag- 
nosed with MDD and receiving treatment in Thailand. The Mini International Neuropsychiatric 
Inventory (MINI), 17-item Hamilton Depression Rating Scale (HAMD-17), 30-item Geriatric 
Depression Scale (GDS-30), 32-item Inventory of Interpersonal Problems scale, Revised Experi- 
ence of Close Relationships scale, ten-item Perceived Stress Scale (PSS-10), and Multidimen- 
sional Scale of Perceived Social Support were used. Follow-up assessments were conducted 
after 3, 6, 9, and 12 months. 

Results: Among the 74 patients, the mean age was 68+6.02 years, and 86% had MDD. Regarding 
the level of agreement found between GDS-30 and MINI, Kappa ranged between 0.1 7 and 0.55, 
while for Gwet's AC1 the range was 0.49 to 0.91. The level of agreement was found to be low- 
est at baseline, and increased during follow-up visits. The correlation between HAMD-17 and 
GDS-30 scores was 0.17 (P=0.16) at baseline, then 0.36 to 0.41 in later visits (P<0.01). The 
PSS-10 score was found to be positively correlated with GDS-30 at baseline, and predicted the 
level of disagreement found between the clinicians and patients when reporting on MDD. 
Conclusion: The level of agreement between the GDS, MINI, and HAMD was found to be 
different at baseline when compared to later assessments. Patients who produced a low GDS 
score were given a high rating by the clinicians. An additional self-reporting tool such as the 
PSS-10 could, therefore, be used in such under-reporting circumstances. 
Keywords: late-life depression, measurement, correlation 

Introduction 

Major depressive disorder (MDD) is a mood disorder commonly found among the 
elderly; it is characterized by a loss of pleasure, sadness, sleep disturbance, a poor 
sense of self, guilty feelings and cognitive impairment. It can lead to impaired social, 
occupational, everyday functioning. MDD causes clinically significant distress and 
suicidal behaviors in some people as well. The prevalence of late-life depression var- 
ies, but evidence suggests that MDD rates in community settings may be around 5.5% 
to 5.9%. 1-3 A recent meta-analysis reported that the median prevalence of MDD in 
long-term care facilities is around 10%. 4 
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MDD can be diagnosed using reliable screening tools and 
diagnostic criteria. There are two main types of screening 
instruments used when assessing depression: self-rating and 
clinician-rating instruments. The different measures used when 
screening for or diagnosing MDD may have an influence on its 
prevalence. 5 Self-rating instruments are easy for patients to use 
if they have the ability to understand the content, though as cog- 
nitive impairment is one of the conditions commonly presented 
among elderly people with MDD, there may be some degree 
of uncertainty surrounding the use of a self-rating instrument 
in such cases. Clinician-rated instruments can be used without 
a patient's cooperation; however, the level of interpretation of a 
patient's own feelings and thoughts may be limited. While the 
Geriatric Depression Scale (GDS) is a widely used self-rating 
screening measure, the Hamilton Depression Rating Scale 
(HAMD) is a commonly used clinician-rated instrument, and 
is used to assess depression in the elderly. 6,7 

While discrepancies between self-rating and clinician- 
rated scales have been found at baseline and may have an 
impact on treatment outcomes for adult depression, the infor- 
mation gathered thus far on geriatric depression is limited, 8,9 
with little or no data available regarding this issue among 
elderly populations in Asian countries. The primary aim of 
this study was to investigate the level of agreement between 
the 30-item Geriatric Depression Scale and the diagnoses 
made using Diagnostic and Statistical Manual of Mental 
Disorders, fourth edition (DSM-IV) criteria when trying to 
detect MDD in elderly Thais. The researchers also wanted to 
examine the association between the severity of depressive 
symptoms and stress, social support, interpersonal problems, 
and attachment levels. 

Material and methods 

This research was part of the Thai Study of Affective Disor- 
der (THAISAD) research project, a prospective, 12-month 
follow-up study of Thai people diagnosed with depressive 
disorders and receiving treatment at eleven hospitals across 
Thailand. Ethical approval for this study was provided by the 
Joint Research Ethics Committee of Thailand, and the Ethics 
Committee of the Ministry of Public Health of Thailand. Details 
regarding the methodology used in the THAISAD project and 
the participants' characteristics can be viewed elsewhere. 10 

Participants and procedures 

In total, 74 participants aged 60 years and over were included 
out of the 371 adult participants taking part in the main 
study (Figure 1). All of the participants had been diagnosed 
with MDD, dysthymia, or both (double depression), 



according to DSM-IV criteria using the Mini International 
Neuropsychiatric Interview (MINI), version 5. 11 All partici- 
pants were assessed at baseline (month 0), and after 3, 6, 9, 
and 12 months. 

Only those participants able to give informed consent 
were recruited. All the participants were treated with anti- 
depressants (ie, selective serotonin reuptake inhibitors, 
serotonin norepinephrine reuptake inhibitors), hypnotics 
or anxiolytics, and/or psychotherapy, and then monitored 
over the course of 12 months. The medication or treatment 
choices available were considered by psychiatrists in line 
with standard guidelines and treatment recommendations. 
Participants who had severe medical comorbidities, a history 
of cognitive disorders (mild cognitive disorder and dementia) 
according to the Mini-Mental State Examination-Thai 
2002 (MMSE-Thai 2002) 12 instrument, who were unable 
to understand the researchers' words, and/or had a history 
of psychiatric comorbidity (ie, alcohol dependence, anxiety 
disorder, organic mental disorder, psychotic disorders, or 
bipolar disorders) were excluded from the study. 

The MINI was conducted by research nurses or psy- 
chiatrist investigators (all except PB). The Clinical Global 
Impression scale (CGI) and HAMD instruments were used by 



371 THAISAD participants 
age >18; MDD or dysthymia, or both 





74 elderly 
age >60 








Visit 1, Month 0 
MINI, HAMD-17, TGDS-30 
PSS-10, ECR-r, MSPSS, MMSE 








Visit 2-A, Months 3-9 
MINI, HAMD-17, TGDS-30 

0 




Mr 




Visit 5 (last), Month 12 
MINI, HAMD-17, TGDS-30 
PSS-10, ECR-r, MSPSS, MMSE 





Figure I Study flow chart. 

Notes: *Visit where the participants reached remission for the first time (HAMD- 
17^10); all measures in visit 5 were tested. 

Abbreviations: ECR-r, Revised Experience of Close Relationships; GDS-30, 30- 
item Geriatric Depression Scale; HAMD-17, 17-item Hamilton Depression Rating 
Scale; MDD, major depressive disorder; MINI, Mini International Neuropsychiatric 
Interview; MMSE, Mini-Mental State Examination; MSPSS, Multidimensional Scale of 
Perceived Social Support; PSS-10, Perceived Stress Scale; TGDS-30, Thai 30-item 
Geriatric Depression Scale; THAISAD, Thai Study of Affective Disorder. 
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psychiatrists. All these three measures were utilized among 
all the participants and on every visit. At baseline, upon 
remission, and at the end of the study, all participants were 
assessed using a socio-demographic questionnaire, outcome 
variables, and psychosocial instruments (see the Instruments 
section below). Self-rating measures were completed by the 
participants, on their own, or by having the questions read 
to them by research assistants. 

Instruments 

The clinician-rated measurements used included the Clinical 
Global Impression - Severity scale (CGI-S), a seven-point 
scale that requires clinicians to rate the severity of the condi- 
tion being assessed (ranging from 1 [normal/not at all], to 
7 [extremely ill]). The severity of depression was assessed 
using: i) the 17-item Hamilton Depression Rating Scale 
(HAMD-17), a clinician-rated scale; ii) the Thai Depression 
Inventory, a 20-item, four-rating scale which assesses the 
severity of depressive symptoms among respondents, rang- 
ing from 1 (most severe) to 4 (normal); iii) the Thai GDS, a 
30-item, true-false questionnaire used to assess depressive 
symptoms in participants aged 60 years of age or over; 13 
iv) the 32-item Inventory of Interpersonal Problems scale, 
an instrument which uses a five -point Likert scale to assess 
the severity of interpersonal problems in the participants' 
daily lives; 14 v) the short version of the Revised Experience 
of Close Relationships scale, an 18-item instrument which 
uses a seven-point Likert scale when asking respondents how 
anxious or close they feels toward a partner and other close 
friends/relatives; 15 vi) the ten- item Perceived Stress Scale 
(PSS-10), a ten-item/five-rating scale in which participants 
report on how frequently they feel stressed; 16 and vii) the Mul- 
tidimensional Scale of Perceived Social Support (MSPSS), a 
12-item, seven-point Likert rating scale in which participants 
report on how they feel on the social support they receive. 17 
All Thai versions of the measurements have demonstrated 
good reliability and validity. The psychometric properties 
of the instruments used are described in another published 
article. 10 

MMSE-Thai 2002 is a clinician-rated measure used to 
evaluate a participant's cognitive impairment and level of 
dementia, and was developed for use with the Thai population 
from a model first developed by Folstein et al. 10,12,18 Using 
this instrument, there are three different cut-off scores for 
impairment, depending on the patients' formal education 
levels, these being: 22 (out of 30) for elementary school level, 
17 (out of 30) for below elementary school level, and 14 (out 
of 23) for those who did not go to school, or are illiterate. 



Data analysis 

In this study, descriptive statistics were used in order to 
describe socio-demographic and clinical characteristics of 
the participants. Pearson's correlation was calculated in order 
to establish any association between the GDS and HAMD 
scores. To compare the level of agreement between clinician 
diagnoses using MINI and the GDS cut-off score (>24), 
Gwet 's AC 1 19 and Cohen's Kappa were used, using AgreeStat 
software version 201 1.3 (Advanced Analytics, Gaithersburg, 
MD, USA). 20,21 Univariate analysis was used to find predic- 
tors for the severity of suicidality, with statistical analysis 
conducted using SPSS for Windows, version 17 (IBM Cor- 
poration, Armonk, NY, USA). 

Results 

Among the 74 elderly patients (71.6% female), the mean 
age was 68+6.02 years, and 86% had MDD (Table 1). The 
mean GDS-30 at baseline was 17.84+6.84, while for the 
follow-up evaluations the GDS-30 scores were 1 1.69+7.82, 
11.28+7.90, 9.41+7.42, and 9.89+7.41 after 3, 6, 9, and 
12 months, respectively (Table 2). 

In terms of the level of agreement between the MINI and 
GDS (cut-off score, 25 or more for severe depression) when 
diagnosing for MDD, the Cohen's Kappa values were 0. 1 72, 
0.551, 0.528, 0. 102, and 0.359 at months 0 (baseline), 3, 6, 9, 
and 12, respectively. In accordance with Altman, the values 
produced at baseline and at month 9 were poor (<0.20). 22 
However, when using Gwet's AC1, the values were 0.487, 
0.778, 0.782, 0.841, and 0.907 at months 0 (baseline), 3, 6, 9, 
and 12, respectively. Based on Altman 's criteria, these values 
could be rated as moderate to very good. 22 The Gwet's AC1 
values produced were consistent with the percentage level 
of agreement (Table 3). The findings demonstrate that levels 
of agreement (calculated by both Cohen's Kappa and Gwet's 
AC1) were low at baseline. 

Table I Participants' socio-demographic and clinical characteristics 



Characteristics Value (n=74) 



Sociodemographic characteristics 




Age, mean ± SD (min-max) 


68±6.02 (63-94) 


Female sex, n (%) 


S3 (71.6) 


Education years, mean ± SD (min-max) 


3.37±3.53 (0-12) 


Marital status: n (%) living with spouse 


36 (48.6) 


Clinical characteristics 




MDD, n (%) 


61 (85.9) 


Dysthymia n (%) 


6 (8.5) 


Double depression n (%) 


4 (5.6) 


MMSE, mean ± SD (min-max) 


23.74±4.03 (12-30) 



Abbreviations: max, maximum; MDD, Major depressive disorder; min, minimum; 
MMSE, Mini-Mental State Examination; n, number; SD, standard deviation. 
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Table 1 Severity of depression as rated by the GDS-30 and 
HAMD- 1 7 at different times 



Moaci i roc 


S C O 1*6 S 


GDS-30, mean ± SD 




Baseline 


1 7.84±6.84 


Month 3 


1 1 .69±7.82 


Month 6 


1 1 .28±7.90 


Month 9 


9.41 ±7.42 


i loncn i z 


Q QQ-I-T A 1 


HAMD- 17, mean±SD 




Baseline 


23.56±6.32 


Month 3 


1 1 .08±6.38 


Month 6 


I2.43±6.3S 


Month 9 


7.89±4.93 


Month 12 


10.1 l±6.23 



Abbreviations: GDS-30, 30-item Geriatric Depression Scale; HAMD- 1 7, 17-ltem 
Hamilton Depression Rating Scale; SD, standard deviation. 



Most of the HAMD- 17 and GDS-30 scores were cor- 
related, with Pearson's correlation coefficients being 0.17 
(P=0.15) at baseline, then 0.40 ( J P=<0.001), 0.36 (^=0.002), 
0.41 (,P<0.001), and 0.38 CP=0.001) after 3, 6, 9, and 
12 months, respectively (Table 4). 

When exploring the level of discordance at baseline, it was 
found that the MMSE and PSS instruments were good predic- 
tors of depression (unstandardized coefficient (B) =-0.367, 
standard error [SE] 0. 1 6, 95% confidence interval [CI] =-0.68 1 
to -0.053, i-=0.022; B=0.21 1, SE 0.08, 95% CI =0.049-0.374, 
P=0.01 1). However, nothing was found to be a predictor at the 
3- and 9-month follow-ups, though PSS was close to being a 
predictor at month 6 (P=0.050). Interestingly, MSPSS was a 
predictor at month 12 (B =2. 143, SE 0.82, 95% CI =3.76-6.72, 
^=0.0 10). 



Table 3 Level of agreement between the MINI and GDS 



(cut-off 25) for each visit 


Month 


Diagnosis 


Diagnosis 


Cohen's 


Gwet's 


Percentage 




by MINI 


by GDS 


Kappa 


ACI 


agreement 






Yes 


No 








0 


Yes 


44 


20 


0.172 


0.487 


68 




No 


4 


6 








3 


Yes 


10 


6 


0.551 


0.778 


85 




No 


5 


53 








6 


Yes 


9 


5 


0.528 


0.782 


85 




No 


6 


54 








9 


Yes 


1 


3 


0.102 


0.841 


86 




No 


7 


63 








12 


Yes 


2 


2 


0.359 


0.907 


92 




No 


4 


66 









Abbreviations: MINI, Mini International Neuropsychiatric Inventory; GDS, 
Geriatric Depression Scale. 



Table 4 Correlations between the GDS and HAMD- 1 7 scores 



Measures/ GDS 



visits 



HAMD 


Baseline 


Month 3 


Month 6 


Month 9 


Month 12 


Baseline 


0.168 


0.081 


0.010 


-0.013 


-0.096 


Month 3 


0.033 


0.400** 


0.286* 


0.194 


0.166 


Month 6 


0.083 


0.316** 


0.361** 


0.185 


0.292* 


Month 9 


0.220 


0.291* 


0.374** 


0.410** 


0.348** 


Month 12 


0.179 


0.133 


0.246* 


0.221 


0.384** 



Notes: ^Correlation is significant at the 0.01 level (two-tailed); **correlation is 
significant at the 0.05 level (two-tailed). 

Abbreviations: GDS-30, 30-item Geriatric Depression Scale; HAMD- 1 7, 17-item 
Hamilton Depression Rating Scale. 



Discussion 

To our knowledge, this is the first study to have compared 
use of the self-reporting GDS and clinician-rated interview 
of depression instruments among elderly patients. The reason 
why there was low agreement between the first and fourth 
assessments (month 9) may be due to the fact that the par- 
ticipating patients' cognitive functions had been poor upon 
admission for treatment, due to their depression. As a result, 
they may have responded in an inconsistent way. Uher et al, 
using a large sample, found that self-reporting instruments 
are consistent with clinician-rated methods, and so may be 
used interchangeably; however, the participants in that study 
were not elderly. 8 Patient factors need to be taken into account 
when reporting symptoms, as they may have a tendency 
to under- or over-rate. 23 In this study, the first assessment 
produced the lowest level of agreement between the two 
measures - the self-reporting and clinician-rated instruments; 
thereafter, the level of agreement was higher, except during 
the fourth assessment when Cohen's Kappa was at its low- 
est (in contradiction of Gwet's ACI). This may have been 
due to problems with the formulae used by Cohen's Kappa 
when compared to Gwet's AC Is, rather than the actual level 
of agreement. 19 

It is interesting to note that those factors predicting the 
disagreements varied at each visit. It is conceivable that 
MMSE (or "poor cognitive function") predicted discor- 
dance in the first evaluation, while the perception of stress 
was also a predictor of disagreement, as shown by the fact 
that that when it reduced in the follow-up visits, the level 
of disagreement decreased accordingly. This highlights 
the fact that a clinician's judgment when diagnosing using 
DSM does not necessarily reflect how the patients really 
feel. 24 Factors found to be associated with discordance in 
other studies include a higher age, being male, having lower 
impairment levels, and the severity of the symptoms being 
experienced. In addition personality factors, such as high 
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levels of neuroticism, and low levels of extraversion and 
agreeableness, may be associated with a greater endorsement 
of depressive symptoms. 24,25 

When it comes to a large population-based study, self- 
reporting questionnaires may be used before applying 
clinician-rated tools. The GDS can still be used to screen for 
depression (nowadays, GDS- 15 is widely used); 26 - 27 however, 
as the results here show, it is difficult to rely on the GDS tool 
alone, particular at baseline stage. Therefore, using another 
questionnaire, such as the perceived stress scale, alongside 
GDS might be of use, as it may provide more information 
to clinicians with regard to the possibility of their under- 
reporting symptoms, prompting them to carry out further 
assessments. 

Conclusion 

When reporting on MDD in this study, the level of agreement 
between patients and clinicians was found to be different 
at baseline when compared to later assessment periods. 
Patients who produced a low GDS score were rated as 
highly depressed by the clinicians when using either MINI 
or HAMD-17. Use of an additional self-reporting tool, such 
as the Perceived Stress Scale, may therefore be useful in such 
under-reporting circumstances. 

Limitations 

The sample used for this study was of a modest size, so gen- 
eralizability of the findings may be limited. As this study was 
part of the THAISAD project, which studied 371 adult and 
elderly patients with depressive disorders across eleven sites, 
researchers preferred using HAMD to the Montgomery- 
Asberg Depression Rating Scale (MADRS), as they were 
more familiar with the HAMD instrument. However, it may 
be that MADRS is more suitable for use among the elderly, 
due to the fact that it has fewer physically related questions 
when compared to the HAMD tool. However, it is generally 
accepted that HAMD and MADRS correlate well. 
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